Identifying Markov Blankets with Decision Tree Induction

نویسندگان

  • Lewis J. Frey
  • Douglas H. Fisher
  • Ioannis Tsamardinos
  • Constantin F. Aliferis
  • Alexander R. Statnikov
چکیده

The Markov Blanket of a target variable is the minimum conditioning set of variables that makes the target independent of all other variables. Markov Blankets inform feature selection, aid in causal discovery and serve as a basis for scalable methods of constructing Bayesian networks. This paper applies decision tree induction to the task of Markov Blanket identification. Notably, we compare (a) C5.0, a widely used algorithm for decision rule induction, (b) C5C, which post-processes C5.0’s rule set to retain the most frequently referenced variables and (c) PC, a standard method for Bayesian Network induction. C5C performs as well as or better than C5.0 and PC across a number of data sets. Our modest variation of an inexpensive, accurate, off-the-shelf induction engine mitigates the need for specialized procedures, and establishes baseline performance against which specialized algorithms can be compared. Appears in the Proceedings of the Third IEEE International Conference on Data Mining, Melbourne, FL Nov 19-22 2003. pp 59-66

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bayesian Network Induction via Local Neighborhoods

In recent years, Bayesian networks have become highly successful tool for diagnosis, analysis, and decision making in real-world domains. We present an efficient algorithm for learning Bayes networks from data. Our approach constructs Bayesian networks by first identifying each node’s Markov blankets, then connecting nodes in a maximally consistent way. In contrast to the majority of work, whic...

متن کامل

Learning Markov Blankets for Continuous or Discrete Networks via Feature Selection

Markov Blankets discovery algorithms are important for learning a Bayesian network structure. We present an argument that tree ensemble masking measures can provide an approximate Markov blanket. Then an ensemble feature selection method is used to learn Markov blankets for either discrete or continuous networks (without linear, Gaussian assumptions). We compare our algorithm in the causal stru...

متن کامل

Comparing different stopping criteria for fuzzy decision tree induction through IDFID3

Fuzzy Decision Tree (FDT) classifiers combine decision trees with approximate reasoning offered by fuzzy representation to deal with language and measurement uncertainties. When a FDT induction algorithm utilizes stopping criteria for early stopping of the tree's growth, threshold values of stopping criteria will control the number of nodes. Finding a proper threshold value for a stopping crite...

متن کامل

Joint Markov Blankets in Feature Sets Extracted from Wavelet Packet Decompositions

Since two decades, wavelet packet decompositions have been shown effective as a generic approach to feature extraction from time series and images for the prediction of a target variable. Redundancies exist between the wavelet coefficients and between the energy features that are derived from the wavelet coefficients. We assess these redundancies in wavelet packet decompositions by means of the...

متن کامل

The Markov blankets of life: autonomy, active inference and the free energy principle

This work addresses the autonomous organization of biological systems. It does so by considering the boundaries of biological systems, from individual cells to Home sapiens, in terms of the presence of Markov blankets under the active inference scheme-a corollary of the free energy principle. A Markov blanket defines the boundaries of a system in a statistical sense. Here we consider how a coll...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003